Dataset statistics
| Number of variables | 26 |
|---|---|
| Number of observations | 300000 |
| Missing cells | 0 |
| Missing cells (%) | 0.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 59.5 MiB |
| Average record size in memory | 208.0 B |
Variable types
| NUM | 16 |
|---|---|
| CAT | 10 |
id has unique values | Unique |
Reproduction
| Analysis started | 2021-02-08 22:16:30.340495 |
|---|---|
| Analysis finished | 2021-02-08 22:18:10.237024 |
| Duration | 1 minute and 39.9 seconds |
| Software version | pandas-profiling v2.9.0 |
| Download configuration | config.yaml |
| Distinct | 300000 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 250018.5769 |
|---|---|
| Minimum | 1 |
| Maximum | 499999 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 24987.95 |
| Q1 | 124772.5 |
| median | 250002.5 |
| Q3 | 375226.5 |
| 95-th percentile | 475032.1 |
| Maximum | 499999 |
| Range | 499998 |
| Interquartile range (IQR) | 250454 |
Descriptive statistics
| Standard deviation | 144450.15 |
|---|---|
| Coefficient of variation (CV) | 0.5777576681 |
| Kurtosis | -1.202249332 |
| Mean | 250018.5769 |
| Median Absolute Deviation (MAD) | 125228 |
| Skewness | 0.0001529463078 |
| Sum | 7.500557308e+10 |
| Variance | 2.086584584e+10 |
| Monotocity | Strictly increasing |
| Value | Count | Frequency (%) | |
| 2049 | 1 | < 0.1% | |
| 158951 | 1 | < 0.1% | |
| 177404 | 1 | < 0.1% | |
| 167163 | 1 | < 0.1% | |
| 165114 | 1 | < 0.1% | |
| 171257 | 1 | < 0.1% | |
| 189686 | 1 | < 0.1% | |
| 195829 | 1 | < 0.1% | |
| 193780 | 1 | < 0.1% | |
| 181490 | 1 | < 0.1% | |
| Other values (299990) | 299990 | > 99.9% |
| Value | Count | Frequency (%) | |
| 1 | 1 | < 0.1% | |
| 2 | 1 | < 0.1% | |
| 3 | 1 | < 0.1% | |
| 4 | 1 | < 0.1% | |
| 6 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 499999 | 1 | < 0.1% | |
| 499998 | 1 | < 0.1% | |
| 499997 | 1 | < 0.1% | |
| 499996 | 1 | < 0.1% | |
| 499993 | 1 | < 0.1% |
cat0
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| A | |
|---|---|
| B | 18529 |
| Value | Count | Frequency (%) | |
| A | 281471 | 93.8% | |
| B | 18529 | 6.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
cat1
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| A | |
|---|---|
| B |
| Value | Count | Frequency (%) | |
| A | 162678 | 54.2% | |
| B | 137322 | 45.8% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
cat2
Categorical
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| A | |
|---|---|
| B | 23449 |
| Value | Count | Frequency (%) | |
| A | 276551 | 92.2% | |
| B | 23449 | 7.8% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
cat3
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| C | |
|---|---|
| A | |
| D | 11174 |
| B | 610 |
| Value | Count | Frequency (%) | |
| C | 183752 | 61.3% | |
| A | 104464 | 34.8% | |
| D | 11174 | 3.7% | |
| B | 610 | 0.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
cat4
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| B | |
|---|---|
| A | 1241 |
| C | 767 |
| D | 619 |
| Value | Count | Frequency (%) | |
| B | 297373 | 99.1% | |
| A | 1241 | 0.4% | |
| C | 767 | 0.3% | |
| D | 619 | 0.2% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
cat5
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| B | |
|---|---|
| D | |
| C | 11763 |
| A | 3878 |
| Value | Count | Frequency (%) | |
| B | 149208 | 49.7% | |
| D | 135151 | 45.1% | |
| C | 11763 | 3.9% | |
| A | 3878 | 1.3% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
cat6
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| A | |
|---|---|
| B | 6344 |
| C | 809 |
| D | 147 |
| I | 24 |
| Other values (3) | 33 |
| Value | Count | Frequency (%) | |
| A | 292643 | 97.5% | |
| B | 6344 | 2.1% | |
| C | 809 | 0.3% | |
| D | 147 | < 0.1% | |
| I | 24 | < 0.1% | |
| E | 19 | < 0.1% | |
| H | 11 | < 0.1% | |
| G | 3 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
cat7
Categorical
| Distinct | 8 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| E | |
|---|---|
| D | 24356 |
| B | 5750 |
| G | 1961 |
| F | 279 |
| Other values (3) | 23 |
| Value | Count | Frequency (%) | |
| E | 267631 | 89.2% | |
| D | 24356 | 8.1% | |
| B | 5750 | 1.9% | |
| G | 1961 | 0.7% | |
| F | 279 | 0.1% | |
| A | 14 | < 0.1% | |
| C | 6 | < 0.1% | |
| I | 3 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
cat8
Categorical
| Distinct | 7 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| C | |
|---|---|
| E | |
| G | |
| A | |
| D | 3694 |
| Other values (2) | 563 |
| Value | Count | Frequency (%) | |
| C | 121054 | 40.4% | |
| E | 94616 | 31.5% | |
| G | 42195 | 14.1% | |
| A | 37878 | 12.6% | |
| D | 3694 | 1.2% | |
| F | 549 | 0.2% | |
| B | 14 | < 0.1% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
cat9
Categorical
| Distinct | 15 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 2.3 MiB |
| F | |
|---|---|
| I | |
| L | |
| H | |
| K | |
| Other values (10) |
| Value | Count | Frequency (%) | |
| F | 107281 | 35.8% | |
| I | 50064 | 16.7% | |
| L | 42200 | 14.1% | |
| H | 24759 | 8.3% | |
| K | 20955 | 7.0% | |
| A | 13408 | 4.5% | |
| G | 10409 | 3.5% | |
| M | 9838 | 3.3% | |
| J | 6981 | 2.3% | |
| O | 6173 | 2.1% | |
| Other values (5) | 7932 | 2.6% |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
cont0
Real number (ℝ)
| Distinct | 299830 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.524633789 |
|---|---|
| Minimum | -0.09350536055 |
| Maximum | 1.052665996 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | -0.09350536055 |
|---|---|
| 5-th percentile | 0.2514193718 |
| Q1 | 0.3704508256 |
| median | 0.4922083191 |
| Q3 | 0.6547925468 |
| 95-th percentile | 0.9326039757 |
| Maximum | 1.052665996 |
| Range | 1.146171357 |
| Interquartile range (IQR) | 0.2843417212 |
Descriptive statistics
| Standard deviation | 0.2048746106 |
|---|---|
| Coefficient of variation (CV) | 0.3905097516 |
| Kurtosis | -0.2241451429 |
| Mean | 0.524633789 |
| Median Absolute Deviation (MAD) | 0.1397477691 |
| Skewness | 0.5098842266 |
| Sum | 157390.1367 |
| Variance | 0.04197360607 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.4560714025 | 2 | < 0.1% | |
| 0.6801987809 | 2 | < 0.1% | |
| 0.4930380606 | 2 | < 0.1% | |
| 0.4683074375 | 2 | < 0.1% | |
| 0.3461965667 | 2 | < 0.1% | |
| 0.3372972656 | 2 | < 0.1% | |
| 0.3489577861 | 2 | < 0.1% | |
| 0.4988903173 | 2 | < 0.1% | |
| 0.3609564062 | 2 | < 0.1% | |
| 0.508918992 | 2 | < 0.1% | |
| Other values (299820) | 299980 | > 99.9% |
| Value | Count | Frequency (%) | |
| -0.09350536055 | 1 | < 0.1% | |
| -0.08429116477 | 1 | < 0.1% | |
| -0.08151528875 | 1 | < 0.1% | |
| -0.07723720256 | 1 | < 0.1% | |
| -0.07590473341 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1.052665996 | 1 | < 0.1% | |
| 1.047962912 | 1 | < 0.1% | |
| 1.044700298 | 1 | < 0.1% | |
| 1.044469089 | 1 | < 0.1% | |
| 1.04391612 | 1 | < 0.1% |
cont1
Real number (ℝ)
| Distinct | 299642 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5066489299 |
|---|---|
| Minimum | -0.05510510604 |
| Maximum | 0.8517463757 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | -0.05510510604 |
|---|---|
| 5-th percentile | 0.04928001596 |
| Q1 | 0.3523071228 |
| median | 0.6151562832 |
| Q3 | 0.6881497715 |
| 95-th percentile | 0.7756661163 |
| Maximum | 0.8517463757 |
| Range | 0.9068514817 |
| Interquartile range (IQR) | 0.3358426487 |
Descriptive statistics
| Standard deviation | 0.2352694791 |
|---|---|
| Coefficient of variation (CV) | 0.4643639121 |
| Kurtosis | -0.6912706609 |
| Mean | 0.5066489299 |
| Median Absolute Deviation (MAD) | 0.1322055851 |
| Skewness | -0.7264677191 |
| Sum | 151994.679 |
| Variance | 0.05535172781 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.7609876137 | 2 | < 0.1% | |
| 0.5516212538 | 2 | < 0.1% | |
| 0.7533256983 | 2 | < 0.1% | |
| 0.6708453372 | 2 | < 0.1% | |
| 0.6374363124 | 2 | < 0.1% | |
| 0.6508764411 | 2 | < 0.1% | |
| 0.6204438534 | 2 | < 0.1% | |
| 0.7311381673 | 2 | < 0.1% | |
| 0.6389326584 | 2 | < 0.1% | |
| 0.04342531913 | 2 | < 0.1% | |
| Other values (299632) | 299980 | > 99.9% |
| Value | Count | Frequency (%) | |
| -0.05510510604 | 1 | < 0.1% | |
| -0.04931508474 | 1 | < 0.1% | |
| -0.04883282998 | 1 | < 0.1% | |
| -0.04778759545 | 1 | < 0.1% | |
| -0.04676215247 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0.8517463757 | 1 | < 0.1% | |
| 0.8475761216 | 1 | < 0.1% | |
| 0.8473649654 | 1 | < 0.1% | |
| 0.8472583892 | 1 | < 0.1% | |
| 0.8457408625 | 1 | < 0.1% |
cont2
Real number (ℝ)
| Distinct | 299707 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4441147267 |
|---|---|
| Minimum | -0.06027372205 |
| Maximum | 1.017689219 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | -0.06027372205 |
|---|---|
| 5-th percentile | 0.1279779537 |
| Q1 | 0.3141211439 |
| median | 0.4572706105 |
| Q3 | 0.5548352803 |
| 95-th percentile | 0.7856643395 |
| Maximum | 1.017689219 |
| Range | 1.077962941 |
| Interquartile range (IQR) | 0.2407141364 |
Descriptive statistics
| Standard deviation | 0.2000885238 |
|---|---|
| Coefficient of variation (CV) | 0.4505334135 |
| Kurtosis | -0.1517279878 |
| Mean | 0.4441147267 |
| Median Absolute Deviation (MAD) | 0.1299177548 |
| Skewness | 0.1710232658 |
| Sum | 133234.418 |
| Variance | 0.04003541735 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.4585841713 | 2 | < 0.1% | |
| 0.1678112063 | 2 | < 0.1% | |
| 0.4073022758 | 2 | < 0.1% | |
| 0.6442625744 | 2 | < 0.1% | |
| 0.391572095 | 2 | < 0.1% | |
| 0.4505376107 | 2 | < 0.1% | |
| 0.3522814386 | 2 | < 0.1% | |
| 0.6608548729 | 2 | < 0.1% | |
| 0.434396014 | 2 | < 0.1% | |
| 0.3933225102 | 2 | < 0.1% | |
| Other values (299697) | 299980 | > 99.9% |
| Value | Count | Frequency (%) | |
| -0.06027372205 | 1 | < 0.1% | |
| -0.05726183114 | 1 | < 0.1% | |
| -0.05584678114 | 1 | < 0.1% | |
| -0.05447625363 | 1 | < 0.1% | |
| -0.05408701575 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1.017689219 | 1 | < 0.1% | |
| 1.014034228 | 1 | < 0.1% | |
| 1.013120756 | 1 | < 0.1% | |
| 1.012765314 | 1 | < 0.1% | |
| 1.004610862 | 1 | < 0.1% |
cont3
Real number (ℝ≥0)
| Distinct | 299796 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4462141003 |
|---|---|
| Minimum | 0.134759848 |
| Maximum | 1.006468677 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0.134759848 |
|---|---|
| 5-th percentile | 0.1697756861 |
| Q1 | 0.2145721353 |
| median | 0.3778233694 |
| Q3 | 0.7197584307 |
| 95-th percentile | 0.8231545671 |
| Maximum | 1.006468677 |
| Range | 0.8717088288 |
| Interquartile range (IQR) | 0.5051862955 |
Descriptive statistics
| Standard deviation | 0.238669041 |
|---|---|
| Coefficient of variation (CV) | 0.5348756142 |
| Kurtosis | -1.362968261 |
| Mean | 0.4462141003 |
| Median Absolute Deviation (MAD) | 0.1839353469 |
| Skewness | 0.4015277675 |
| Sum | 133864.2301 |
| Variance | 0.05696291111 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.3852205792 | 2 | < 0.1% | |
| 0.7449862479 | 2 | < 0.1% | |
| 0.2149184211 | 2 | < 0.1% | |
| 0.1947428184 | 2 | < 0.1% | |
| 0.3477642227 | 2 | < 0.1% | |
| 0.357932437 | 2 | < 0.1% | |
| 0.1600022898 | 2 | < 0.1% | |
| 0.1660321026 | 2 | < 0.1% | |
| 0.747554062 | 2 | < 0.1% | |
| 0.3498813174 | 2 | < 0.1% | |
| Other values (299786) | 299980 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0.134759848 | 1 | < 0.1% | |
| 0.1356604394 | 1 | < 0.1% | |
| 0.1362060803 | 1 | < 0.1% | |
| 0.1363881483 | 1 | < 0.1% | |
| 0.1364402415 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1.006468677 | 1 | < 0.1% | |
| 1.003940291 | 1 | < 0.1% | |
| 0.9969731804 | 1 | < 0.1% | |
| 0.9910977123 | 1 | < 0.1% | |
| 0.9887089693 | 1 | < 0.1% |
cont4
Real number (ℝ≥0)
| Distinct | 299736 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.455471435 |
|---|---|
| Minimum | 0.1892162896 |
| Maximum | 0.9940500684 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0.1892162896 |
|---|---|
| 5-th percentile | 0.2768757143 |
| Q1 | 0.2798526181 |
| median | 0.4113511369 |
| Q3 | 0.621808131 |
| 95-th percentile | 0.8272805699 |
| Maximum | 0.9940500684 |
| Range | 0.8048337788 |
| Interquartile range (IQR) | 0.3419555129 |
Descriptive statistics
| Standard deviation | 0.2006950827 |
|---|---|
| Coefficient of variation (CV) | 0.4406315463 |
| Kurtosis | -0.8981667078 |
| Mean | 0.455471435 |
| Median Absolute Deviation (MAD) | 0.1319898 |
| Skewness | 0.7436693388 |
| Sum | 136641.4305 |
| Variance | 0.04027851622 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.2792919534 | 3 | < 0.1% | |
| 0.7196430026 | 2 | < 0.1% | |
| 0.2587691724 | 2 | < 0.1% | |
| 0.2774150506 | 2 | < 0.1% | |
| 0.2783440273 | 2 | < 0.1% | |
| 0.278350752 | 2 | < 0.1% | |
| 0.2802594014 | 2 | < 0.1% | |
| 0.279204459 | 2 | < 0.1% | |
| 0.2799262621 | 2 | < 0.1% | |
| 0.2776613497 | 2 | < 0.1% | |
| Other values (299726) | 299979 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0.1892162896 | 1 | < 0.1% | |
| 0.1915212826 | 1 | < 0.1% | |
| 0.1927385949 | 1 | < 0.1% | |
| 0.1928958691 | 1 | < 0.1% | |
| 0.1944662391 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0.9940500684 | 1 | < 0.1% | |
| 0.9837858239 | 1 | < 0.1% | |
| 0.9746762892 | 1 | < 0.1% | |
| 0.9734029052 | 1 | < 0.1% | |
| 0.9711389496 | 1 | < 0.1% |
cont5
Real number (ℝ)
| Distinct | 299857 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5083365866 |
|---|---|
| Minimum | -0.08724666205 |
| Maximum | 1.044433461 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | -0.08724666205 |
|---|---|
| 5-th percentile | 0.2114987688 |
| Q1 | 0.3387473087 |
| median | 0.4413842656 |
| Q3 | 0.7095145248 |
| 95-th percentile | 0.9226693048 |
| Maximum | 1.044433461 |
| Range | 1.131680123 |
| Interquartile range (IQR) | 0.3707672161 |
Descriptive statistics
| Standard deviation | 0.2316121906 |
|---|---|
| Coefficient of variation (CV) | 0.455627623 |
| Kurtosis | -0.9605123321 |
| Mean | 0.5083365866 |
| Median Absolute Deviation (MAD) | 0.1697970386 |
| Skewness | 0.5111504075 |
| Sum | 152500.976 |
| Variance | 0.05364420686 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.3860724892 | 2 | < 0.1% | |
| 0.2408908087 | 2 | < 0.1% | |
| 0.2663496701 | 2 | < 0.1% | |
| 0.3778093369 | 2 | < 0.1% | |
| 0.7991562877 | 2 | < 0.1% | |
| 0.443814565 | 2 | < 0.1% | |
| 0.7487363578 | 2 | < 0.1% | |
| 0.2728234387 | 2 | < 0.1% | |
| 0.8664042048 | 2 | < 0.1% | |
| 0.3397759862 | 2 | < 0.1% | |
| Other values (299847) | 299980 | > 99.9% |
| Value | Count | Frequency (%) | |
| -0.08724666205 | 1 | < 0.1% | |
| -0.01892834578 | 1 | < 0.1% | |
| 0.01732678368 | 1 | < 0.1% | |
| 0.02789173931 | 1 | < 0.1% | |
| 0.03055298554 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1.044433461 | 1 | < 0.1% | |
| 1.041097235 | 1 | < 0.1% | |
| 1.040165077 | 1 | < 0.1% | |
| 1.039924191 | 1 | < 0.1% | |
| 1.03979545 | 1 | < 0.1% |
cont6
Real number (ℝ≥0)
| Distinct | 299875 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4783452172 |
|---|---|
| Minimum | 0.04395333548 |
| Maximum | 1.093311508 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0.04395333548 |
|---|---|
| 5-th percentile | 0.2481665982 |
| Q1 | 0.3398959616 |
| median | 0.410089752 |
| Q3 | 0.6042464047 |
| 95-th percentile | 0.8689058501 |
| Maximum | 1.093311508 |
| Range | 1.049358172 |
| Interquartile range (IQR) | 0.2643504431 |
Descriptive statistics
| Standard deviation | 0.1924322851 |
|---|---|
| Coefficient of variation (CV) | 0.4022874656 |
| Kurtosis | -0.1490648539 |
| Mean | 0.4783452172 |
| Median Absolute Deviation (MAD) | 0.09975577419 |
| Skewness | 0.8710045443 |
| Sum | 143503.5652 |
| Variance | 0.03703018436 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.6098912541 | 2 | < 0.1% | |
| 0.2350868814 | 2 | < 0.1% | |
| 0.3964871376 | 2 | < 0.1% | |
| 0.2466746401 | 2 | < 0.1% | |
| 0.2730613587 | 2 | < 0.1% | |
| 0.3985872577 | 2 | < 0.1% | |
| 0.3932007262 | 2 | < 0.1% | |
| 0.2537843024 | 2 | < 0.1% | |
| 0.9318401104 | 2 | < 0.1% | |
| 0.3650764499 | 2 | < 0.1% | |
| Other values (299865) | 299980 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0.04395333548 | 1 | < 0.1% | |
| 0.05054153902 | 1 | < 0.1% | |
| 0.05191413439 | 1 | < 0.1% | |
| 0.05208155824 | 1 | < 0.1% | |
| 0.05233008675 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1.093311508 | 1 | < 0.1% | |
| 1.086769156 | 1 | < 0.1% | |
| 1.085660826 | 1 | < 0.1% | |
| 1.084925361 | 1 | < 0.1% | |
| 1.084750491 | 1 | < 0.1% |
cont7
Real number (ℝ≥0)
| Distinct | 299832 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.45590421 |
|---|---|
| Minimum | 0.2087026958 |
| Maximum | 1.036540503 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0.2087026958 |
|---|---|
| 5-th percentile | 0.2397147848 |
| Q1 | 0.2780409482 |
| median | 0.3607356108 |
| Q3 | 0.6393880398 |
| 95-th percentile | 0.8533803757 |
| Maximum | 1.036540503 |
| Range | 0.8278378069 |
| Interquartile range (IQR) | 0.3613470916 |
Descriptive statistics
| Standard deviation | 0.2044927142 |
|---|---|
| Coefficient of variation (CV) | 0.4485431581 |
| Kurtosis | -0.7468133166 |
| Mean | 0.45590421 |
| Median Absolute Deviation (MAD) | 0.1122627755 |
| Skewness | 0.7051679981 |
| Sum | 136771.263 |
| Variance | 0.04181727015 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.2851157547 | 2 | < 0.1% | |
| 0.3439632618 | 2 | < 0.1% | |
| 0.4734162923 | 2 | < 0.1% | |
| 0.6513646586 | 2 | < 0.1% | |
| 0.3006936796 | 2 | < 0.1% | |
| 0.3249387366 | 2 | < 0.1% | |
| 0.7657798896 | 2 | < 0.1% | |
| 0.2321970514 | 2 | < 0.1% | |
| 0.7633394589 | 2 | < 0.1% | |
| 0.6618641241 | 2 | < 0.1% | |
| Other values (299822) | 299980 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0.2087026958 | 1 | < 0.1% | |
| 0.2094665209 | 1 | < 0.1% | |
| 0.2101580642 | 1 | < 0.1% | |
| 0.2105090733 | 1 | < 0.1% | |
| 0.2105537786 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1.036540503 | 1 | < 0.1% | |
| 1.035724038 | 1 | < 0.1% | |
| 1.035470569 | 1 | < 0.1% | |
| 1.03145898 | 1 | < 0.1% | |
| 1.029685806 | 1 | < 0.1% |
cont8
Real number (ℝ≥0)
| Distinct | 299765 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.4593209938 |
|---|---|
| Minimum | 0.004041388119 |
| Maximum | 1.014155557 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0.004041388119 |
|---|---|
| 5-th percentile | 0.1405207448 |
| Q1 | 0.3086545446 |
| median | 0.4258013333 |
| Q3 | 0.5415252359 |
| 95-th percentile | 0.9077543621 |
| Maximum | 1.014155557 |
| Range | 1.010114169 |
| Interquartile range (IQR) | 0.2328706913 |
Descriptive statistics
| Standard deviation | 0.220641617 |
|---|---|
| Coefficient of variation (CV) | 0.4803647558 |
| Kurtosis | -0.09412954961 |
| Mean | 0.4593209938 |
| Median Absolute Deviation (MAD) | 0.1169121001 |
| Skewness | 0.725130309 |
| Sum | 137796.2982 |
| Variance | 0.04868272316 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.3041154756 | 2 | < 0.1% | |
| 0.3657306513 | 2 | < 0.1% | |
| 0.4590615545 | 2 | < 0.1% | |
| 0.5023262668 | 2 | < 0.1% | |
| 0.8710757732 | 2 | < 0.1% | |
| 0.8761131015 | 2 | < 0.1% | |
| 0.4040838067 | 2 | < 0.1% | |
| 0.6034779781 | 2 | < 0.1% | |
| 0.4162661644 | 2 | < 0.1% | |
| 0.8709318274 | 2 | < 0.1% | |
| Other values (299755) | 299980 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0.004041388119 | 1 | < 0.1% | |
| 0.004490192741 | 1 | < 0.1% | |
| 0.009537106334 | 1 | < 0.1% | |
| 0.009616991211 | 1 | < 0.1% | |
| 0.0102242393 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1.014155557 | 1 | < 0.1% | |
| 1.013401927 | 1 | < 0.1% | |
| 1.012661764 | 1 | < 0.1% | |
| 1.011870334 | 1 | < 0.1% | |
| 1.011292281 | 1 | < 0.1% |
cont9
Real number (ℝ≥0)
| Distinct | 299863 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5268991427 |
|---|---|
| Minimum | 0.07303994469 |
| Maximum | 0.9720914516 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0.07303994469 |
|---|---|
| 5-th percentile | 0.232363823 |
| Q1 | 0.3619570588 |
| median | 0.4888671825 |
| Q3 | 0.7527645954 |
| 95-th percentile | 0.8337855587 |
| Maximum | 0.9720914516 |
| Range | 0.899051507 |
| Interquartile range (IQR) | 0.3908075366 |
Descriptive statistics
| Standard deviation | 0.204025052 |
|---|---|
| Coefficient of variation (CV) | 0.3872184171 |
| Kurtosis | -1.211933365 |
| Mean | 0.5268991427 |
| Median Absolute Deviation (MAD) | 0.1550879577 |
| Skewness | 0.2232278554 |
| Sum | 158069.7428 |
| Variance | 0.04162622184 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.7830558473 | 2 | < 0.1% | |
| 0.8156356359 | 2 | < 0.1% | |
| 0.6317912834 | 2 | < 0.1% | |
| 0.8207000644 | 2 | < 0.1% | |
| 0.8176908478 | 2 | < 0.1% | |
| 0.8126861353 | 2 | < 0.1% | |
| 0.807500815 | 2 | < 0.1% | |
| 0.196485855 | 2 | < 0.1% | |
| 0.8173360067 | 2 | < 0.1% | |
| 0.3461489301 | 2 | < 0.1% | |
| Other values (299853) | 299980 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0.07303994469 | 1 | < 0.1% | |
| 0.08050657872 | 1 | < 0.1% | |
| 0.08091262586 | 1 | < 0.1% | |
| 0.08267598197 | 1 | < 0.1% | |
| 0.08340700918 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0.9720914516 | 1 | < 0.1% | |
| 0.9690038723 | 1 | < 0.1% | |
| 0.965512222 | 1 | < 0.1% | |
| 0.9634616139 | 1 | < 0.1% | |
| 0.9629578089 | 1 | < 0.1% |
cont10
Real number (ℝ≥0)
| Distinct | 299894 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5049429014 |
|---|---|
| Minimum | 0.05964379697 |
| Maximum | 1.029773341 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0.05964379697 |
|---|---|
| 5-th percentile | 0.1945640674 |
| Q1 | 0.338897848 |
| median | 0.519854825 |
| Q3 | 0.6728090213 |
| 95-th percentile | 0.8382278076 |
| Maximum | 1.029773341 |
| Range | 0.9701295439 |
| Interquartile range (IQR) | 0.3339111733 |
Descriptive statistics
| Standard deviation | 0.201548871 |
|---|---|
| Coefficient of variation (CV) | 0.3991518059 |
| Kurtosis | -0.9044132014 |
| Mean | 0.5049429014 |
| Median Absolute Deviation (MAD) | 0.1685714556 |
| Skewness | 0.08465076136 |
| Sum | 151482.8704 |
| Variance | 0.04062194739 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.4862561041 | 2 | < 0.1% | |
| 0.2764418087 | 2 | < 0.1% | |
| 0.5756927733 | 2 | < 0.1% | |
| 0.6590147934 | 2 | < 0.1% | |
| 0.2932241967 | 2 | < 0.1% | |
| 0.7408512821 | 2 | < 0.1% | |
| 0.6181024688 | 2 | < 0.1% | |
| 0.7272338414 | 2 | < 0.1% | |
| 0.7436206291 | 2 | < 0.1% | |
| 0.337467558 | 2 | < 0.1% | |
| Other values (299884) | 299980 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0.05964379697 | 1 | < 0.1% | |
| 0.06314982847 | 1 | < 0.1% | |
| 0.06655382025 | 1 | < 0.1% | |
| 0.06978422146 | 1 | < 0.1% | |
| 0.06998373298 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1.029773341 | 1 | < 0.1% | |
| 1.028627392 | 1 | < 0.1% | |
| 1.027378701 | 1 | < 0.1% | |
| 1.025105811 | 1 | < 0.1% | |
| 1.025025636 | 1 | < 0.1% |
cont11
Real number (ℝ≥0)
| Distinct | 299877 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5299381007 |
|---|---|
| Minimum | 0.06416121457 |
| Maximum | 1.038048933 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0.06416121457 |
|---|---|
| 5-th percentile | 0.191424597 |
| Q1 | 0.3166618274 |
| median | 0.5588266091 |
| Q3 | 0.7203810423 |
| 95-th percentile | 0.8677535134 |
| Maximum | 1.038048933 |
| Range | 0.9738877186 |
| Interquartile range (IQR) | 0.4037192148 |
Descriptive statistics
| Standard deviation | 0.2308599637 |
|---|---|
| Coefficient of variation (CV) | 0.4356357156 |
| Kurtosis | -1.383586373 |
| Mean | 0.5299381007 |
| Median Absolute Deviation (MAD) | 0.2033826778 |
| Skewness | -0.03074609761 |
| Sum | 158981.4302 |
| Variance | 0.05329632286 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.7454032375 | 2 | < 0.1% | |
| 0.7124307057 | 2 | < 0.1% | |
| 0.6997179805 | 2 | < 0.1% | |
| 0.708948735 | 2 | < 0.1% | |
| 0.6799414024 | 2 | < 0.1% | |
| 0.7254231375 | 2 | < 0.1% | |
| 0.4293840502 | 2 | < 0.1% | |
| 0.6578644058 | 2 | < 0.1% | |
| 0.2757554015 | 2 | < 0.1% | |
| 0.2719823734 | 2 | < 0.1% | |
| Other values (299867) | 299980 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0.06416121457 | 1 | < 0.1% | |
| 0.06562327932 | 1 | < 0.1% | |
| 0.06816543333 | 1 | < 0.1% | |
| 0.06915909477 | 1 | < 0.1% | |
| 0.070765679 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 1.038048933 | 1 | < 0.1% | |
| 1.036472705 | 1 | < 0.1% | |
| 1.034723563 | 1 | < 0.1% | |
| 1.033762813 | 1 | < 0.1% | |
| 1.03273021 | 1 | < 0.1% |
cont12
Real number (ℝ)
| Distinct | 299824 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.5245492482 |
|---|---|
| Minimum | -0.005599995244 |
| Maximum | 0.9613704167 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | -0.005599995244 |
|---|---|
| 5-th percentile | 0.2767894312 |
| Q1 | 0.3321430521 |
| median | 0.4073648664 |
| Q3 | 0.7324313196 |
| 95-th percentile | 0.880974641 |
| Maximum | 0.9613704167 |
| Range | 0.9669704119 |
| Interquartile range (IQR) | 0.4002882675 |
Descriptive statistics
| Standard deviation | 0.2208917893 |
|---|---|
| Coefficient of variation (CV) | 0.4211078178 |
| Kurtosis | -1.461532898 |
| Mean | 0.5245492482 |
| Median Absolute Deviation (MAD) | 0.1241142907 |
| Skewness | 0.3728983307 |
| Sum | 157364.7745 |
| Variance | 0.04879318256 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.33796517 | 2 | < 0.1% | |
| 0.3057334428 | 2 | < 0.1% | |
| 0.3280673977 | 2 | < 0.1% | |
| 0.3585106814 | 2 | < 0.1% | |
| 0.3503079769 | 2 | < 0.1% | |
| 0.3373837636 | 2 | < 0.1% | |
| 0.2691703888 | 2 | < 0.1% | |
| 0.3441662932 | 2 | < 0.1% | |
| 0.3203546768 | 2 | < 0.1% | |
| 0.3533426701 | 2 | < 0.1% | |
| Other values (299814) | 299980 | > 99.9% |
| Value | Count | Frequency (%) | |
| -0.005599995244 | 1 | < 0.1% | |
| 0.0138050169 | 1 | < 0.1% | |
| 0.01535109932 | 1 | < 0.1% | |
| 0.01897772249 | 1 | < 0.1% | |
| 0.02822295157 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0.9613704167 | 1 | < 0.1% | |
| 0.9606612908 | 1 | < 0.1% | |
| 0.9587155008 | 1 | < 0.1% | |
| 0.958533659 | 1 | < 0.1% | |
| 0.9580715793 | 1 | < 0.1% |
cont13
Real number (ℝ≥0)
| Distinct | 299866 |
|---|---|
| Distinct (%) | > 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.503349028 |
|---|---|
| Minimum | 0.1581209416 |
| Maximum | 0.8735786266 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0.1581209416 |
|---|---|
| 5-th percentile | 0.207126659 |
| Q1 | 0.2912885661 |
| median | 0.433908679 |
| Q3 | 0.7308702762 |
| 95-th percentile | 0.8168110085 |
| Maximum | 0.8735786266 |
| Range | 0.7154576849 |
| Interquartile range (IQR) | 0.4395817102 |
Descriptive statistics
| Standard deviation | 0.2252176585 |
|---|---|
| Coefficient of variation (CV) | 0.4474383498 |
| Kurtosis | -1.634448892 |
| Mean | 0.503349028 |
| Median Absolute Deviation (MAD) | 0.2096072261 |
| Skewness | 0.1307868945 |
| Sum | 151004.7084 |
| Variance | 0.05072299368 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 0.6273081492 | 2 | < 0.1% | |
| 0.288095053 | 2 | < 0.1% | |
| 0.3884877383 | 2 | < 0.1% | |
| 0.2686897208 | 2 | < 0.1% | |
| 0.26994353 | 2 | < 0.1% | |
| 0.2682442138 | 2 | < 0.1% | |
| 0.3956211113 | 2 | < 0.1% | |
| 0.4110199937 | 2 | < 0.1% | |
| 0.7911254652 | 2 | < 0.1% | |
| 0.6812815463 | 2 | < 0.1% | |
| Other values (299856) | 299980 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0.1581209416 | 1 | < 0.1% | |
| 0.158265398 | 1 | < 0.1% | |
| 0.1586329232 | 1 | < 0.1% | |
| 0.159824792 | 1 | < 0.1% | |
| 0.1601819379 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 0.8735786266 | 1 | < 0.1% | |
| 0.8664160048 | 1 | < 0.1% | |
| 0.8656272721 | 1 | < 0.1% | |
| 0.8647702666 | 1 | < 0.1% | |
| 0.8640303312 | 1 | < 0.1% |
target
Real number (ℝ≥0)
| Distinct | 299648 |
|---|---|
| Distinct (%) | 99.9% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.456260484 |
|---|---|
| Minimum | 0 |
| Maximum | 10.30920751 |
| Zeros | 1 |
| Zeros (%) | < 0.1% |
| Memory size | 2.3 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 5.986960362 |
| Q1 | 6.798340504 |
| median | 7.496503224 |
| Q3 | 8.161166382 |
| 95-th percentile | 8.769955955 |
| Maximum | 10.30920751 |
| Range | 10.30920751 |
| Interquartile range (IQR) | 1.362825879 |
Descriptive statistics
| Standard deviation | 0.8872947048 |
|---|---|
| Coefficient of variation (CV) | 0.1189999607 |
| Kurtosis | -0.4364989081 |
| Mean | 7.456260484 |
| Median Absolute Deviation (MAD) | 0.6804074946 |
| Skewness | -0.2012646961 |
| Sum | 2236878.145 |
| Variance | 0.7872918932 |
| Monotocity | Not monotonic |
| Value | Count | Frequency (%) | |
| 6.108285511 | 3 | < 0.1% | |
| 6.932713792 | 2 | < 0.1% | |
| 5.625573328 | 2 | < 0.1% | |
| 5.733541388 | 2 | < 0.1% | |
| 7.610031197 | 2 | < 0.1% | |
| 8.050184107 | 2 | < 0.1% | |
| 6.690032424 | 2 | < 0.1% | |
| 8.038191225 | 2 | < 0.1% | |
| 5.892355285 | 2 | < 0.1% | |
| 7.640924121 | 2 | < 0.1% | |
| Other values (299638) | 299979 | > 99.9% |
| Value | Count | Frequency (%) | |
| 0 | 1 | < 0.1% | |
| 2.391029438 | 1 | < 0.1% | |
| 2.648898066 | 1 | < 0.1% | |
| 2.857556711 | 1 | < 0.1% | |
| 3.343645706 | 1 | < 0.1% |
| Value | Count | Frequency (%) | |
| 10.30920751 | 1 | < 0.1% | |
| 10.29096483 | 1 | < 0.1% | |
| 10.28224856 | 1 | < 0.1% | |
| 10.27913759 | 1 | < 0.1% | |
| 10.25730878 | 1 | < 0.1% |
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.Cramér's V (φc)
Cramér's V is an association measure for nominal random variables. The coefficient ranges from 0 to 1, with 0 indicating independence and 1 indicating perfect association. The empirical estimators used for Cramér's V have been proved to be biased, even for large samples. We use a bias-corrected measure that has been proposed by Bergsma in 2013 that can be found here.First rows
| id | cat0 | cat1 | cat2 | cat3 | cat4 | cat5 | cat6 | cat7 | cat8 | cat9 | cont0 | cont1 | cont2 | cont3 | cont4 | cont5 | cont6 | cont7 | cont8 | cont9 | cont10 | cont11 | cont12 | cont13 | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | A | B | A | A | B | D | A | E | C | I | 0.923191 | 0.684968 | 0.124454 | 0.217886 | 0.281421 | 0.881122 | 0.421650 | 0.741413 | 0.895799 | 0.802461 | 0.724417 | 0.701915 | 0.877618 | 0.719903 | 6.994023 |
| 1 | 2 | B | A | A | A | B | B | A | E | A | F | 0.437627 | 0.014213 | 0.357438 | 0.846127 | 0.282354 | 0.440011 | 0.346230 | 0.278495 | 0.593413 | 0.546056 | 0.613252 | 0.741289 | 0.326679 | 0.808464 | 8.071256 |
| 2 | 3 | A | A | A | C | B | D | A | B | C | N | 0.732209 | 0.760122 | 0.454644 | 0.812990 | 0.293756 | 0.914155 | 0.369602 | 0.832564 | 0.865620 | 0.825251 | 0.264104 | 0.695561 | 0.869133 | 0.828352 | 5.760456 |
| 3 | 4 | A | A | A | C | B | D | A | E | G | K | 0.705142 | 0.771678 | 0.153735 | 0.732893 | 0.769785 | 0.934138 | 0.578930 | 0.407313 | 0.868099 | 0.794402 | 0.494269 | 0.698125 | 0.809799 | 0.614766 | 7.806457 |
| 4 | 6 | A | B | A | A | B | B | A | E | C | F | 0.486063 | 0.639349 | 0.496212 | 0.354186 | 0.279105 | 0.382600 | 0.705940 | 0.325193 | 0.440967 | 0.462146 | 0.724447 | 0.683073 | 0.343457 | 0.297743 | 6.868974 |
| 5 | 7 | A | A | A | C | B | B | A | E | E | B | 0.319723 | 0.741507 | 0.648946 | 0.434844 | 0.280691 | 0.245560 | 0.217362 | 0.606298 | 0.345282 | 0.351235 | 0.371940 | 0.222782 | 0.279227 | 0.773600 | 7.060652 |
| 6 | 8 | A | B | A | C | B | D | A | E | G | F | 0.867227 | 0.067449 | 0.199145 | 0.840979 | 0.278068 | 0.931266 | 0.633157 | 0.784185 | 0.912704 | 0.801154 | 0.599786 | 0.901656 | 0.837474 | 0.674477 | 6.165491 |
| 7 | 9 | B | A | B | C | B | B | A | E | C | L | 0.348204 | 0.619819 | 0.550846 | 0.182465 | 0.784294 | 0.363130 | 0.349324 | 0.252734 | 0.474170 | 0.434062 | 0.729585 | 0.575455 | 0.695125 | 0.224310 | 8.100110 |
| 8 | 10 | A | B | B | A | B | D | A | E | E | F | 0.920995 | 0.486993 | 0.244655 | 0.196558 | 0.278993 | 0.949677 | 0.532924 | 0.722310 | 0.924495 | 0.812624 | 0.594173 | 0.884272 | 0.816702 | 0.777538 | 8.180236 |
| 9 | 11 | A | B | A | C | B | D | A | E | E | G | 0.853132 | 0.353198 | 0.189864 | 0.249299 | 0.281267 | 0.825722 | 0.626083 | 0.310801 | 0.944050 | 0.491695 | 0.562309 | 0.555027 | 0.615598 | 0.484117 | 6.589764 |
Last rows
| id | cat0 | cat1 | cat2 | cat3 | cat4 | cat5 | cat6 | cat7 | cat8 | cat9 | cont0 | cont1 | cont2 | cont3 | cont4 | cont5 | cont6 | cont7 | cont8 | cont9 | cont10 | cont11 | cont12 | cont13 | target | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 299990 | 499980 | A | A | A | C | B | D | A | D | G | F | 0.899401 | 0.762942 | 0.114374 | 0.700306 | 0.279428 | 0.870174 | 0.776437 | 0.717039 | 0.927952 | 0.846331 | 0.902140 | 0.898102 | 0.873316 | 0.782536 | 6.199239 |
| 299991 | 499985 | A | A | A | D | B | B | A | E | C | K | 0.442036 | 0.418143 | 0.324973 | 0.276804 | 0.520999 | 0.338526 | 0.838508 | 0.372358 | 0.404663 | 0.722228 | 0.425598 | 0.850828 | 0.365547 | 0.697558 | 7.892157 |
| 299992 | 499988 | A | B | A | C | B | B | A | E | E | L | 0.392158 | 0.058307 | 0.498862 | 0.416967 | 0.424167 | 0.264094 | 0.322022 | 0.254636 | 0.310304 | 0.314588 | 0.244861 | 0.257667 | 0.331081 | 0.795666 | 8.199213 |
| 299993 | 499989 | A | A | A | A | B | D | A | D | A | F | 0.524388 | 0.298947 | 0.504207 | 0.281260 | 0.556754 | 0.783050 | 0.691795 | 0.694392 | 0.494858 | 0.522301 | 0.788228 | 0.851542 | 0.891290 | 0.733786 | 8.365215 |
| 299994 | 499992 | A | A | A | A | B | D | A | E | C | F | 0.474155 | 0.744986 | 0.491379 | 0.176338 | 0.584547 | 0.730974 | 0.578603 | 0.329631 | 0.503306 | 0.555314 | 0.713684 | 0.428304 | 0.647398 | 0.801595 | 6.783325 |
| 299995 | 499993 | A | B | A | C | B | B | A | E | E | L | 0.260716 | 0.712438 | 0.161661 | 0.442794 | 0.768447 | 0.269578 | 0.258655 | 0.363598 | 0.300619 | 0.340516 | 0.235711 | 0.383477 | 0.215227 | 0.793630 | 8.343538 |
| 299996 | 499996 | A | B | A | C | B | B | A | E | E | L | 0.173302 | 0.121591 | 0.592514 | 0.193711 | 0.775951 | 0.197211 | 0.257024 | 0.574304 | 0.227035 | 0.322583 | 0.286094 | 0.324874 | 0.306933 | 0.230902 | 7.851861 |
| 299997 | 499997 | A | B | A | C | B | B | A | E | C | M | 0.342856 | 0.617869 | 0.462991 | 0.418098 | 0.297406 | 0.449482 | 0.386172 | 0.476217 | 0.135947 | 0.502730 | 0.235788 | 0.316671 | 0.250286 | 0.349041 | 7.600558 |
| 299998 | 499998 | A | B | B | C | B | B | A | D | E | F | 0.599403 | 0.686054 | 0.660860 | 0.187199 | 0.758642 | 0.363130 | 0.324132 | 0.229017 | 0.220888 | 0.515304 | 0.389391 | 0.245234 | 0.303895 | 0.481138 | 8.272095 |
| 299999 | 499999 | A | A | B | A | B | D | A | E | C | K | 0.475451 | 0.037659 | 0.753772 | 0.398129 | 0.696047 | 0.734712 | 0.404145 | 0.497719 | 0.497974 | 0.782585 | 0.751251 | 0.608412 | 0.712868 | 0.452400 | 6.025685 |